Cantor sets in the line: 
scaling function and the smoothness of the shiftmap 

by F.Przytycki and F. Tangerman 

Abstract: Consider d disjoint closed subintervals of the unit interval and consider an 
orientation preserving expanding map which maps each of these subintervals to the whole 
unit interval. The set of points where all iterates of this expanding map are defined is a 
Cantor set. Associated to the construction of this Cantor set is the scaling function which 
records the infinitely deep geometry of this Cantor set. This scaling function is an invariant 
of C 1 conjugation. We solve the inverse problem posed by Dennis Sullivan: given a scaling 
function, determine the maximal possible smoothness of any expanding map which produces 
it. 

Consider the space = { 1, .., d with its standard shiftmap a 

cr(aia2..) = («2---) 
Denote by a" 1 the d right-inverse of cr: 

(otiot2..) = (iaia 2 ..) 

Our convention will be not to use separating comma's in strings of symbols. 

Srf with the product topology is a Cantor set. Consider an embedding h of the space 
Erf = { 1, .., d into 1R with the standard order: 

h(a) > h((3) iff a m > (3 m 

where m is the first integer for which a m ^ (3 m . The image of h is also a Cantor set. Denote 
by / the induced shiftmap on the image of h and by f^ 1 the d right-inverses of /. Let 
r > 1. We say that h is C r if each of the right-inverses ff 1 have C r extensions to M which 
are contractions. We say then that the Cantor set is C r . 

Every C 1+e Cantor set has a scaling function, defined below and there is a simple char- 
acterization of those functions which are scaling functions for some C 1+e Cantor set. In this 
paper we describe those scaling functions which actually have to C k+e realizations. Here k 
is any integer greater or equal to 1 and < e < 1. We follow the convention that e = 1 
means a Lipschitz condition. 

The theory for r = 1 + e is essentially due to Feigenbaum and Sullivan who introduced 
the scaling function. It is defined in the following manner. Given an embedding h, then 
the shiftmap allows a canonical definition of the image of h as an intersection of nested 
collections of intervals. More precisely, define for any finite sequence (ji-.jn) Iji..j n as 

the 

convex hull of h({ a : a\ = j n ,..,a n = j\ }) Note the order in which the indices occur. 
Then for any jo, Ij Q j 1 ..j n C Ij 1 ..j n and the shiftmap maps Ij 1 ..j n to Ij 1 ..j n _ 1 - For the empty 
string, / denotes the image of h. The sets thus constructed are not intervals, but actually 
small pieces of the image of h. It is however convenient to think of them as intervals. 

For any subset J in the reals denote by < J > its convex hull and by \J\ the length of 
its convex hull. We will in the remainder always assume that < / > is the unit interval [0,1]. 



Denote the set of finite strings ji, ..,j n of length n by . The scaling function (ratio 
geometry) at level n is a function S n : 

S : Egf - (0, l) 2 ^ 1 

defined in the following manner. For each j\..j n S(j\..j n ) records the geometrical location of 
the d intervals { Ij j 1 ..j n }j =i..d in Ij\..j n by the ratio's of lengths of these d intervals (first d 
coordinates) and d — 1 gaps (last d — 1 coordinates) to the length of Ij 1: ..j n - In particular 
for jo = 1, --,d the jo — th coordinate of S is given by the following formula: 



S(ji..jn)j = tj 



Ij0»jn\ 



n-3n I 

The sum of all ratio's of lengths equals one. Therefore S actually takes values in the 2d — 2 
dimensional simplex Simp2d~2 of (0, l) 2d_1 where the sum of the coordinates equals I. 
Moreover lengths of intervals are determined by the scaling functions at all levels: 

l J ji..jJ = n-kS(jk+l~jn)j k (1) 

Consider two finite sequences j = j\..j n and j' = ji--j' m . There is a canonical identification 
between Ij and If defined as follows. Let j fl j' be the longest string which agrees with 
both the beginning of j and the beginning of j'. Then suitable iterates of the shiftmap map 
Ij to Ijnf respectively If to ij n j/.(see diagram) 

/ \ 
I f Ij 

The fundamental observation is that if the embedding is C 1+e then the identification map 
is close to being linear in the following precise sense. Define the nonlinearity of a diffeomor- 
phism / on an interval as 

, Df{x) 
log sup 

x,y,x^y U J\V) 

Then the nonlinearity of the identification map can be estimated from above in terms of the 
length of the intermediary interval Ijnf- Bvrt then if j fl j' is long (i.e. |ijnj'l small), the 
subdivision of Ij is close to that of Ij'. One concludes that there exists a uniform 7 such 
that < 7 < 1 

\S(ji..j n ) - S(j[..j'J\ < 7» (jnj,) (inequality 1) (2) 

Here (j(j fl j') denotes the length of j fl j'. Therefore for any infinite sequence j = (jij2---) 
the scaling function S: 

S U) = „l™ S{ji-jn) 
is well defined and has a Holder modulus of continuity: 

\S(j) - S(f)\ < 7 « (jn/) 
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This scaling function is canonically denned on the dual Cantor set £^ ua , whose elements 
are infinite sequences (jij2--)- Each such sequences should be thought of as a prescribed 
sequence of inverse branches of the shiftmap. 

Say that a map is C 1+ if it is C 1 + e for some e. 



Theorem: [Sullivan] Every C 1+ embedding has a Holder continuous scaling function. 
The scaling function is a C 1 invariant. Every Holder continuous function on the dual Cantor 
set with values in Simp2d-2 is the scaling function of a C 1+ embedding. 

Here the Holder continuity of the scaling function is defined with respect to a metric on 

Y^dual. 
^d ■ 

Ps(j, f) = exp(-5 tt(j n f) 

In the theorem S (the metric on Y,f al ) is not specified so we cannot specify e. 

The problem which remained was to understand which functions occur as scaling func- 
tions for C 1+1 and higher smoothness. Here we give necessary and sufficient conditions for 
a function S to arise as a scaling function for a C k + e (k positive integer and < e < 1) 
embedding. The main observation is that given an embedding, we should be able to extend 
the identification map between Ij and ly to their convex hulls < Ij > and < If to be C k+t 
close to affine provided j fl j' is long. Here close to affine is measured after affmely rescaling 
< Ij > and < Ij/ > to the unit interval. We refer to the process of changing the map by 
rescaling domain and range to the unit interval as renormalization. 



We will first characterize those functions which are scaling functions of C 1+e Cantor sets. 
This is a special case of the main theorem. We state it seperately because of its simpler form. 
Given a function S : E^ a/ — > Simpid-2- We replace an arbitrary metric ps on J^f 10,1 with 
a metric p$ so that for an embedding with S as scaling function there exits K so that for 



every j, j': 

- < i^4v < K (3) 
K ~ Psti, j') ~ 1 ^ 

This metric is defined as: 

PSUJ') = SUpn£~f (jnj ] S(j t +ljt+2-jnW) jt 
w 

(3) holds by (1) because any infinite tail w changes the product by a uniformly bounded 
factor (by (2)). 

Theorem 1: Fix < e < 1. The following are equivalent: 

1. There exists a C 1+e embedding with scaling function S. 

2. S is C e on {Hf al , p s ). (Here C 1 means Lipschitz). 

Proof: That 1. =>• 2. follows when one observes that a stronger form of (2) holds: 



\S{h-3n) ~ S{j[..j' n )\ < K\I jnf \ e (4) 
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This inequality carries over to the scaling function. Next apply (3). 

That 2. =>- 1., i.e. the construction of a C 1+e Cantor set will be done in the proof of 
the Main Theorem. 

□ 



Example 1: For every < e\ < £2 < 1 there exists S admitting a C 1+ei embedding but 
not C 1+e2 . We find it as follows: Fon an arbitrary < v < £2 2 £1 we can easily find a function 
S to Simp2d-2 which is C 1+ei+u but not C 1+e2 ~ v on with a standard metric p$, 5 > 

\ogd. We can find in fact S so that for every j e S^^, % — 1, .., d \ —\ogS(j)i/5 — 1| < v jt2- 
This is chosen so that S is C ei but not C 62 with respect to the metric p$- 



We now turn to the more intricate case of higher smoothness. 

Let A\ and A2 be two subsets of the unit interval / = [0, 1] such that both sets contain 
the endpoints of / and both have equal cardinality. Denote the k — th derivative operator 
by D k and denote by D k {A\ ) A2) the space of C k diffeomorphisms on / which map A\ to 
A2. For every constant M > consider the space of C7 fc -diffeomorphisms: 

Di r (M)(A!, A 2 ) = {0 e D k (A 1 ,A 2 ) : sup \D k <p(x) - D k <p(y)\ < M} 



Lemma: Assume that A\ and A2 consist of 2d points. Assume that k < 2 d. Then for 
each f , g in D k ar (M)(Ai, A2) we have for all integers t < k: 

sup\D l f - D l g\ < M 

Proof: Consider two such maps / and g. Their difference vanishes on A\ . Since 2 d > k, 
there exists (mean value theorem) for each t a point xt in / for which: 

rffixt) - rfgixt) = 

The lemma follows by induction and integration. D 

Given a function S as above and a point j in T^ al . Consider S(j). It encodes a partition 
of I in 2d — 1 intervals. Denote by A(j) the 2d end points of these intervals. Consider any 
jo = 1, ••, d and consider the point joj in T^ al . Then S(joj) specifies how the jo — th interval 
in j is subdivided. Consider two points j and j' in Y^ al Every element in D k (A(j), A(j')) 
maps the jo — th interval in the domain to the jo — th interval in the range, which we again 
can renormalize. This defines a map (restrict to jo — th interval and renormalize): 

R j0 : D k (A(j), AW)) - £ fc ({0, 1}, {0, 1}) 
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Main Theorem: Suppose k < 2 d. Suppose that we are given a function S as above. 
The following are equivalent. 

1. There exists a C k+e embedding with scaling function S. 

2. There exists a constant C so that for all j and j' in T l d l ual and all jo = \,..,d: 

D k var (C(j 03 , joj'))(AVoj),AVoj')) n R j0 (D k V ar(C(j, j'))(A(j), A{j')) + (b 
where for all j' G Y^ al , 

C(j,f) = C psV, j')"*- 1 



Discussion of statement of theorem: The statement of the theorem may appear ob- 
scure. We briefly discuss in an informal manner how the scaling function records smoothness 
beyond C 1 . 

1) Consider two strings j and j' and the identification map between Ij and ly. The 
scalings S(j) and S(j') record how 2c? specific points in Ij map to 2d specific points in Iji. 
Consider the renormalized identification map , and assume that we know that the variation 
of the k th derivative of this identification map is small. Consider any k + 1 of the 2 d 
specific points. Since we know where these points map, we can compute a value of the k th 
derivative (just as the standard mean value theorem computes a value of the first derivative 
given 2 points and their values). Because the variation of the k — th derivative is small , we 
obtain combinatorial relations between any two choices of k + 1 points. Condition 2. of the 
theorem captures this idea. It omits attempts to describe the derivative algebraically 

2) In fact we do not need all 2d points which appear in the definition of the ratio 
geometry to be involved in the definition of -D^ ar 's, k+1 would be enough (see Lemma). In 
particular for C 1+€ the condition 2. makes impression we do not need the geometry at all. 
However then the condition (4) in Prof of Theorem 1 is hidden in 2. . Without (4) a map 
/ — > / in D^ ar (A(joj), A(joj')), even linear, after renormalizing by RJ^ may happen not to 
be extendible to a map belonging to the second D in 2. . 

3) The condition of the main theorem seems to imply that high smoothness is not 
discussed when d is small. We can however replace d by any positive power d n in the 
following manner. E^ is canonically homeomorphic to S^n, by the homeomorphism which 
groups the digits of a point in E^ in groups of n digits. This homeomorphism conjugates the 
n — th iterate of the shiftmap on E^ to the shiftmap on E^n. 

Proof of Main Theorem: We first show that 1 implies 2.. Assume that we are given 
a C k+e embedding h. Denote the induced shiftmap on the image by /. We may assume 
that its d right-inverses extend as C k + e contractions to the unit interval, the convex hull of 
the image of h. Denote by fji\j the identification between < Ij > and < Iji > and denote by 
Fji\j the renormalized identification defined on the unit interval J. Then fj'\j, respectively 
Fj'\j, factors as a composition: 

fj'\j = fj'\3'r\j ° fj'nj\j 

F j'\j = F f\fnj ° F fnj\j 
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Since ff\fr\j :< Ij'nj >^< If > is a composition of C k+e contractions the derivatives of 
fj'U'nj are controlled by the first derivative. 

More precisely, by a standard computation which we leave to the reader, there exists a 
constant C so that for all j and f, all 1 < t < k + e 

\ff\fnj\t < C\ff\fnj\i 

Here \.\t denotes the sup norm of the t — th derivative for t integer and the a- Holder norm 
of the n — th derivative if t = n + a, < a < 1. 

But then: 



\t 



\ F j'\j'nj\t - ,, | \ff\fnj 



r 

< \Ifr\j\ 1 1 C 



The last inequality follows because: 

I T -, I 

Dff lfnj (x) 



w 

\Ifnj\ 



for some point x G Ij'nj an d the bounded nonlinearity of the maps. 

Now let j and j' be two distinct points in Y l j ual . Denote by j n , respectively j' n the 
beginning strings of length n. Then for n large enough j fl j' = j n fl j' n and the sequence 
of maps {Fj' n \fnj} ^ s C k + e -equicontinuous. Since moreover: 

F f n+m \f^ = F f n+m \& ° F A\fnj 

this sequence of maps is in fact C k+e convergent. Denote by -^/j/nj ^ ne limit map. By the 
same argument Fff n j is defined. Therefore: the limiting map: 

F f\j = F f\fnj ° F f~fnj 

is well-defined and C k+e and therefore in D k ar . Since ps(j, j') is uniformly comparable to 
| ij'nj I we obtain that this limiting map Ff\j in D k ar (C') for some uniform constant C'. 
Since moreover: 

Rjo F j'\j = F joj'\joj 
we automatically have an element in the intersection. 2. now follows. 



We next show that 2. implies 1.. Since S is given, we first construct an embedding of 
the Cantor set with S as scaling function. We then show that this embedding is C k + e . 

Fix an arbitrary infinite word w. Construct a Cantor set C in the unit interval < / > by 
consecutively subdividing any interval < Ij > according to S(jw). We obtain an embedding 
with scaling function S. Denote the induced shiftmap on the image by /o- It is defined on 
a Cantor set C. In order to show that this shiftmap has a C k+e extension, we verify the 
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assumptions to Whitney's extension theorem [Stein]. We will construct functions fi,----fk 
on C so that for all x, y in C and I = 0, k (Whitney conditions): 

My) = £ j^rjy ft-iWiv - x ) l ~ l + o(\ y - x\ k - l+ <) 

These functions fi, ..fk play the role of the first k derivatives of J'q. 

The interval < / > is subdivided in d intervals < l{ >, i = l,..,d. On each of the 
intervals, fo maps U = Cfl < 1% > to / = C n J by /o. Now fix a i = 1, ..<i. We will 
work on each < h> separately. For each t = 1, .., k define ft on Jj as: 

ft = /////„ . x { /Vo,,..,,,., }:/;...,„: 

Here 4>j 1 ..j n ,i '■ Jj x ..j n i Jji..j„ is an y ma P whose renormalization is in 

D k aT ( c (h"3niw, j 1 ..j n w)(A(j 1 ..j n iw), A(ii..j n w)) 

We need to see that ft is in fact well-defined on the Cantor set. We first verify that ft 
is defined point wise on the Cantor set. Consider a string j\..j n and an element jo- For 
x G Ij ji..j n i, consider <fij ji..j n i(x) and 4>ji..j n i(x) and their t — th derivatives. Then by 
assumption 2. and the Lemma: 

\ Dt( t>kh-jni{ x ) - Dt( Pji..jni( x )\ < 

l-Ljojl-jnl 



C\I(ji..j n i] 



\k+e-t 



Therefore we obtain the convergence on the Cantor set in fact exponentially fast. 

We need to check that the Whitney conditions hold on the Cantor set. Let x and y be 
distinct points in the Cantor set in < ij >. Consider the first time that they wind up in 
different intervals in the subdivision: 

Then again by 2.: 

t=k ^ 

\<t>h..jni(y) - o :h . .;„,;{■>■) - £ - D l <\> h . 4ni {x){y - xf \ < C \x - y\ k+e 

t=o l - 

(and similar for the higher derivatives) where C is a uniform constant. Since \D t ipj 1 ^ n i{x) — 
ft{x)\ <C\x- y\ k+i -\ 

we can take limits and obtain the Whitney conditions for the family fo, fi, fk- Conse- 
quently there exists a C k+e extension of / to each Jj and we have produced a C k+e embedding 
of Erf with scaling function S. □ 



We say that two embeddings h\ and hi are C r equivalent if the composition hi o h\ 
admits an extension as a C^-diffeomorphism to M. It is well known that if h\ and hi are C r 
equivalent then the composition hio h^ 1 in fact admits an extension as a C r -diffeomorphism. 
This result can also be deduced as a corollary of the method employed in the main theorem. 

Corollary: Assume that hi and hi are equivalent C k + € embeddings: hi o h^ 1 is C 1 . 
Then hioh^ 1 is C k+e . 

Proof: To show that the conjugacy hio h^ 1 has a C k+e extension, it suffices to construct 
its higher derivatives on the Cantor set and apply the Whitney extension theorem. This can 
be achieved using the same manner as that employed in the second half of the proof of the 
main theorem. Both embeddings have the same scaling function S so , as the embeddings are 
(jk+i ^ ne ra tj geometries on finite levels are close to one another in the sense of condition 
2. of Main Theorem. D 

Remark: The preceding theorem is not totally satisfactory, since we do not understand 
how to extract C fc -smoothness (k integer!) from the scaling function. This is because in the 
previous scheme everything which needs to be controlled is dominated by geometric series. 
More refined finite smoothness categories like c l+zygmund can however be treated in much 
the same way. 



We finally show in an example that conditions 2. of the main theorem can be explicitly 
checked, by constructing for every k, d, e with k < 2 d — 1 and < e < 1 an example of 
a scaling function with a C k + e realization and none of higher degree of smoothness. 

Example: Let Ji = [f^rf , f^ry], i = 1, »,d 
Define / : U; J; -> J = [0, 1] as: 

f(x) = A{(2d - l)x + x k+e ), x G Ji 

while / is affine on each Jj, % > 2. Here the the constant A is chosen so that f{J\) = J. 

Of course the resulting Cantor set is C k+e . We will show that its scalingfunction on the 
dual Cantor set has no C k + ei realization for all e\ > e, by explicitly checking that condition 
2. of the main theorem does not hold for k + t\. 

Let w be any element in Y^ al which does not contain the symbol I. Denote by l n the 
string of length n consisting of l's only: 



Consider the infinite strings j = l n w and j' = l n lw = l n +iw. Consider the subdivision 
A(j), respectively A(j'), of the unit interval dictated by S(j) and S(j'). Let $ n be any map in 
D k ar (A(j), A(j') for which its renormalized restriction is in fact in D k ar (A(lj), A(lj'). 
We will bound the variation of the k — th derivative of $ n from below and conclude that 
condition 2. of the main theorem is not satisfied with k + e\. 

We denote by A(j) m the m — th point from the left in A(j). Because k < 2d — 1 there 
exists x G [A(j)i, A(j)2d] such that: 

D k F flj (x) = D k ® n {x) 



8 



Recall that Fj/y is the renormalization of fji\j for the map / defined above. See the notation 
of the proof of the Main Theorem. Similarly there exists y G ..,A(lj)2d-i] so that: 

D k F im {y) = D k (R^ n )(y) 

. We have that: 

D k F f]j (x) = B 2d - l- n(k - 1+e K e 

(note that (2d — l)~ n ~ Ps(j',j))- The map Fij/^j is just the renormalization of the 
restriction of the limit map Fj/y to the left most interval L4(j)i, A(j)2] in the unit interval. 
Let y' be the point in the interval [A(j)\, A(j)2\ 1 corresponding to y after rescaling the unit 
interval back to [A(j)\, A (7)2]. Then we have that: 

D k (F flj )(y') = B (2d - l)-< k - 1+ ^ (y'Y 

where B is a computable constant. 

But \x — y'\ > const (2d — l) 2 . Consequently: 

D k $ n (x) - D k <$> n (y') = const (2d - i)-«(*-i+<0 ^ _ 

and is comparable to: 

Ps(j',j) k - 1 + e 

i.e. the variation of D k Q n is at least on the order of: p s (j',j) kl + e - 
Since: 

r PsU',j) k - 1 + ei n 

urn — — — -, — ^— — = 

n ^°° Ps(j',j) k ~ 1 + e 

condition 2. of the theorem can not be satisfied for 



C(j',j) = CpsU'J) 



-1 + ei 



Reference: 
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